Integrating Speech and Natural Language
نویسنده
چکیده
The overall goal of this project is to integrate speech and natural language knowledge sources to build a speech understanding system for human-machine communication using spoken English. The speech knowledge sources use acoustic models based on hidden Markov modeling techniques. The natural language knowledge sources use a Unification grammar formalism for describing the syntax of English, a higher-order intensional logic language for representing the meaning of an utterance, and a 'Montague Grammar style' framework for interfacing syntax and semantics. The objective of an integrated search strategy is to find the globally optimal (by acoustic likelihood) interpretation of the input given the constraints that are imposed by the syntactic and semantic components. Our approach in the BBN Spoken Language System (SLS) uses hidden Markov word models to determine a 'dense' word lattice (to minimize errors due to early decisions) of possible words present in the input speech. Then, a lattice parser is used to find all parses for all the syntactically possible word sequences present in the lattice (ordered by acoustic likelihood). Finally, a semantic interpreter determines from the ordered list of possible word sequences the highest scoring meaningful word sequence. That word sequence is the recognized sentence and its meaning is the interpretation of the input speech. The lattice parser is an extension of our bottom-up word-synchronous parser for the Unification grammar to accept a word lattice as input. The algorithm determines all possible grammatical word sequences and ranks them by acoustic likelihood scores. Then, that list of grammatical sentences is processed to determine the highest scoring word sequence that is also meaningful using the application domain semantics. The SLS system was evaluated using the DARPA Resource Management database. For the language modeling components (syntax and semantics), we used a training corpus of 791 sentences, which has been used for system development, and a test corpus of 200 sentences, which was unseen by the system developers. The grammar coverage was 92 % for the training corpus and 81% for the test corpus. The coverage of the semantic interpreter was 75 % and 52 % for the two corpora, respectively. We have also measured the speech understanding performance on three speakers from the 1000word Resource Management database. The word accuracy on the 1000-word task improves from 71% when no language model is used to 87 % when the syntax alone is used and to 92 % when both syntax and semantics are used.
منابع مشابه
A Comparison between Three Methods of Language Sampling: Freeplay, Narrative Speech and Conversation
Objectives: The spontaneous language sample analysis is an important part of the language assessment protocol. Language samples give us useful information about how children use language in the natural situations of daily life. The purpose of this study was to compare Conversation, Freeplay, and narrative speech in aspects of Mean Length of Utterance (MLU), Type-token ratio (TTR), and the numbe...
متن کاملIntegrating Speech and Natural-Language Processing
SRI has developed a new architecture for integrating speech and natural-language processing that applies linguistic constraints during recognition by incrementally expanding the state-transition network embodied in a unification grammar. We compare this dynamic-gralnlnar-network (DGN) approach to its principal alternative, word-lattice parsing, presenting preliminary experimental results that s...
متن کاملA Contrastive Study of Request Speech Act in English and Persian Novels: Natural Semantic Metalanguage Approach
The Natural Semantic Metalanguage (NSM) Approach claims that there are some universalities in all languages. Speech acts seem to be present in all languages, but considering this approach, research has not indicated whether request speech act differs from one language to another. Thus, this study intended to investigate whether request strategies are used differently in English and Persian roma...
متن کاملA Contrastive Study of Request Speech Act in English and Persian Novels: Natural Semantic Metalanguage Approach
The Natural Semantic Metalanguage (NSM) Approach claims that there are some universalities in all languages. Speech acts seem to be present in all languages, but considering this approach, research has not indicated whether request speech act differs from one language to another. Thus, this study intended to investigate whether request strategies are used differently in English and Persian roma...
متن کاملIntegrating Natural Language Generation with XML Web Technology
The paper describes a software demo integrating Natural Language Generation (NLG) techniques with recent developments in XML web technology. The NLG techniques include a form of template-based generation, transformation of text plan trees to text specification trees, and a multi-stage pipeline architecture. The web technology includes XSLT transformation processors, an XML database, a Java serv...
متن کاملIntegrating Natural Language Generation with the Apache Cocoon XML Server
The paper describes a software demo integrating Natural Language Generation (NLG) techniques with recent developments in XML web technology. The NLG techniques include a form of template-based generation, transformation of text plan trees to text specification trees, and a multi-stage pipeline architecture. The web technology includes XSLT transformation processors, an XML database, a Java serv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1989